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Abstract 



An identifying code is a subset of vertices of a graph such that each vertex is uniquely determined 
^ ■ by its neighbourhood within the identifying code. If 7™ (G) denotes the minimum size of an identifying 

^0 ' code of a graph G, it was conjectured by F. Foucaud, R. Klasing, A. Kosowski and A. Raspaud that 

there exists a constant c such that if a connected graph G with n vertices and maximum degree d 
■ admits an identifying code, then 7™ (G) < n — ^ + c. We use probabihstic tools to show that for 

any d > 3, 7^°(G) < n — holds for a large class of graphs containing, among others, all regular 
graphs and all graphs of bounded clique number. This settles the conjecture (up to constants) for 
these classes of graphs. In the general case, we prove 7™(G) < n — gp^y- In a second part, we prove 

that in any graph G of minimum degree S and girth at least 5, 7™(G) < (1 + os{l)) '^^§§-^n. Using 
the former result, we give sharp estimates for the size of the minimum identifying code of random 
^ , d-regular graphs, which is about ^^^n. 

^ ! 1 Introduction 

> 



Given a graph G, an identifying code C is a dominating set such that for any two vertices, their neigh- 
bourhoods within C are nonempty and distinct. This property can be used to distinguish all vertices 



, of the graph from each other. Identifying codes have found applications to various fields since the in- 



troduction of this concept in [19]. These applications include the location of threats in facilities using 
I sensors [55], error-detection schemes [IH] and routing [3T] in networks, as well as the structural analysis 

of RNA proteins [TBI (under the denomination of differentiating-dominating sets). 

In this paper, we address the question of lower and upper bounds on the size of an identifying code, 
thus extending earlier works on such questions (see e.g. [23l H] [TH [Til [12] ) . We focus on degree-related 
graph parameters such as the minimum and maximum degree, and also study the case of regular graphs. 
An important part of the paper is devoted to giving the best possible upper bound for the size of an 
I identifying code depending on the order and the maximum degree of the graph, a question raised in [10] . 

We also give improved bounds for graphs of large girth and study identifying codes in random regular 
graphs. The main tools used herein are probabilistic. 

We begin by giving our notations and defining the concepts used throughout the paper. 
As graphs and unless specifically mentioned, we understand simple, undirected and finite graphs. The 
vertex set of a graph G is denoted by V{G) and its edge set E{G). We also denote its order by n = |y(G')|. 
The maximum degree of G will be denoted by d = d(G), its minimum degree, hy 6 = 6{G), and its average 
degree, by d = d{G). We denote hy u ^ v, the adjacency between two vertices u and w, and by m 7^ w, 
their non- adjacency. The set of neighbours of some vertex v is called its open neighbourhood and denoted 
by N(y), whereas the set of its neighbours and itself is called its closed neighbourhood and denoted by 
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N[v]. If two distinct vertices u,v are such that N[u] — N[v], they are called twins. If N{u) — N{v) but 
u V, u and v are called false twins. The symmetric difference between two sets A and B is denoted by 
AAB. We use log(x) to denote the natural logarithm of x. We also make use of the standard asymptotic 
notations o, O, Q, fl and lu. Usually the asymptotics are taken either on variables d, 5 or n. We use the 
notations Ox{l) and Oa;(l) to stress the fact that the asymptotic is taken on variable x. If we write o(l) 
or 0(1), then by convention the asymptotic is taken only on n, the number of vertices. 

Given a graph G and a subset C of vertices of G, C is called a dominating set if each vertex of V{G)\C 
has at least one neighbour in C. Set C is called a separating set of G if for each pair u, v of vertices of G, 
7V[u] n C 7^ iV[t)] n C (equivalently, {N[u]AN[v]) n C ^ 0). We have the following definition: 

Definition. Given a graph G, a subset of vertices ofV{G) which is both a dominating set and a separating 
set is called an identifying code of G. 

First of all it must be stressed that not every graph can have an identifying code. Observe that a 
graph containing twin vertices does not admit any separating set or identifying code. In fact a graph 
admits an identifying code if and only if it is twin-free, i.e. it has no pair of twins (one can see that 
if G is twin-free, V{G) is an identifying code of G). Note that if for three distinct vertices u,v,w of a 
twin-free graph G, A^[M]AiV['i;] = {w}, then w belongs to any identifying code of G. In this case we say 
that u) is uv-forced, or simply forced. Observe that any isolated vertex must belong to any identifying 
code for the reason that it must be dominated. For example, an edgeless graph needs all the vertices in 
any identifying code. Hence, the bounds of this paper only hold for graphs with few isolated vertices. 
In order to shorten the statements of our results, we assume that all considered graphs do not have any 
isolated vertices. 

For a given graph, the problem of finding a minimum identifying code is known to be NP-hard [7] , 
even in graphs having small maximum degree and high girth (to be precise, in planar graphs of maximum 
degree 4 having arbitrarily large girth [2] and planar graphs of maximum degree 3 and girth at least 9 [3] ) . 

The minimum size of an identifying code in a graph G, denoted 7™(G), is the identifying code number 
of G. It is known that for a twin-free graph G on n vertices having at least one edge, we have: 

riog2(n+l)l <7^°(G)<n-l 

The lower bound is proved in 'W and the upper bound, in |14] . Both bounds are tight and all graphs 
reaching these two bounds have been classified (see [23] for the lower bound and [TT] for the upper bound). 

When considering graphs of given maximum degree d, it was shown in |19j that the lower bound can 
be improved to 7™ (G) > . This bound is tight and a classification of all graphs reaching it has been 
proposed in |10] . For any d, these graphs include some regular graphs and graphs of arbitrarily large 
girth. 

It was conjectured in [12] that the following upper bound holds. 

Conjecture 1 f[12)). There exists a constant c such that for any nontrivial connected twin-free graph G 
of maximum degree d, 7™(G) < n ~ ^ + c. 

Graphs of maximum degree d such that 7™(G) = n — are known (e.g. the complete bipartite graph 
Kd^d and richer classes of graphs described in Section [B]). Therefore if Conjecture [Tj holds, there would 
exist a constant c such that for any twin-free graph G on n vertices and of maximum degree d we would 
have < 7™(G) < n — ^ + c, with both bounds being tight. 

Note that Conjecture [T] holds for graphs of maximum degree 2 (see [15]). It was shown in [11] that 
7™(G) < 71 — Q(di) 1 and < n — opjy when G has no forced vertices (in particular, this is true 

when G is regular). It is also known that the conjecture holds in an asymptotic way if G is triangle- free: 
then,f°(G)<n-,,(,^ [12, 

Identifying codes have been previously studied in two models of random graphs, that is the classic 
random graph model [13^ and the model of random geometric graphs |25j . To our knowledge random 
regular graphs have not been studied in the context of identifying codes. 

In this paper, we further study Conjecture [T] and prove that it is tight (up to constants) for large 
enough values of d and for a large class of graphs, including regular graphs and graphs of bounded clique 
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number f Corollaries [51 and [T^ . In the general case, we prove that 7^(0) < n — Q^^y (Corollary [TU)) . 
These results improve the known bounds given in and support Conjecture [TJ Moreover, we show that 
the much improved upper bound of 7™(G) < (1 + 0^(1)) holds for graphs having girth at least 5 

and minimum degree 6 (Theorem [TU). This bound is used to give an asymptotically tight bound of about 
i^fii^n for the identifying code number of almost all random d-regular graphs (Corollary [52]). 

We summarize our results for the special case of regular graphs in Table [T] and compare them to the 
bound for the dominating set problem (the table contains references for both the bound and its tightness) . 
All bounds are asymptotically tight. We note that identifying codes behave far from dominating sets in 
general, as shown by the first lines of the table: there are regular graphs having much larger identifying 
code number than domination number. However, for larger girth and for almost all regular graphs, the 
bounds for the two problems coincide asymptotically, as shown by the last lines of the table. 





Identifying codes 


Dominating sets 


in general 


" 103d 

Thm. [71 Constr. [24l 


loe; d 

~ — 

lu, m 


girth 4 


^ n 

<i(l + Oi(l)) 

[H], Constr. [ISI 


- ^« 
n, m 


girth 5 


(l + o,(l))^n 
Thm. [m Thm. [20| 


~ ^« 
m 


almost all graphs 


log d+loglog d+Od(l)^ 

Thm. [H Thm. [20| 


- ^« 



Table 1: Summary of the upper bounds for d- regular graphs 

In order to prove our results, we use probabilistic techniques. For some results, we use the weighted 
version of Lovasz' Local Lemma to show the existence of an identifying code, together with the ChernofF 
bound to show that this code is small enough. To bound the number of forced vertices in a graph we study 
an auxiliary directed graph that captures the underlying structure of these vertices. This new technique 
we introduce can be useful to study the number of forced vertices in a more general context, which is 
an important problem in the community of identifying codes. We also make use of other probabilistic 
techniques such as the Alteration Method [1] in order to give better bounds in more restricted cases. 
Finally, we work with the Configuration Model [B] in order to compute the identifying code number of 
almost all random regular graphs. 

The organization of this paper is as follows. In Section [2] we state some preliminary results which will 
be used throughout the paper. In Section[31 we improve the known upper bounds on the identifying code 
number of graphs of maximum degree d. This gives new large families of graphs for which Conjecture [1] 
holds (up to constants). In Section [4l we give an upper bound for graphs having minimum degree S 
and girth at least 5. In Section [Sj we give sharp bounds for the identifying code number of almost all 
d-regular graphs. A further section is dedicated to various constructions of families of graphs which show 
the tightness of some of our results (Section [B]). 

2 Preliminary results 

We first recall a well-known probabilistic tool: the Lovasz Local Lemma. We use its weighted version, a 
particularization of the general version where each event has an assigned weight. The proof can be found 
in [H]. 

Lemma 2 (Weighted Local Lemma [22]). Let £ = {Ei, . . . , Em} be a set of (typically "bad") events such 
that each Ei is mutually independent of £ \ (T>i U {Ei}) where Vi C £. Suppose that there exist some 
integer weights ti, . . . , tM > 1 and a real p < \ such that for each 1 < i < M : 
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• Pr(£^j) < P**, and 

Note that in Lemma [21 since v < \ and (1 — x) > e"^^'"^^)^ in x e [0, 1/2], we have: 

Pr(n -ii ^) > exp {-(2 log 2) Ef£i(2p)** } • (1) 

We also use the following version of the well-known Chernoff bound, which is a reformulation of 
Theorem A. 1.13 in [T]. 

Theorem 3 (Chernoff bound I]). Lei \ be a random variable of n independent trials of probability p, 
and let a > be a real number. Then Pt(X — np < —a) < . 

The following observation gives an equivalent condition for a set to be an identifying code, and follows 
from the fact that for two vertices u, v at distance at least 3 from each other, 7V[u]A7V[i;] — N[u] U N[v]. 

Observation 4. For a graph G and a set C C V{G), if C is dominating and N[u] DC ^ N[v] flC for 
each pair of vertices u, v at distance at most two from each other, then N[u\ nC ^ N[v] nC for each pair 
of vertices of the graph. 

The next observation is immediate, but it is worth mentioning here. 

Observation 5. Let G be a twin-free graph and C, an identifying code of G. Any set C such that C QC' 
is also an identifying code of G. 

The next proposition shows an upper bound on the number of false twins in a graph. 

Proposition 6. Let G he a graph on n vertices having maximum degree d and no isolated vertices, then 
G has at most pairs of false twins. 

Proof. Let us build a graph H on V{G), where two vertices u, v are adjacent in H if they are false twins 
in G. Note that since a vertex can have at most d~l false twins, H has maximum degree d — 1. Therefore 
it has at most "'■'^'"'"^ edges and the claim follows. □ 

Note that the bound of Proposition |6] is tight since in a complete bipartite graph Kd^d, n — 2d and 
there are exactly 2(2) = "^'^^""^^ pairs of false twins. 



3 Upper bounds on the identifying code number 
3.1 Main theorem 

In this section, we improve the known upper bounds of [llj on the identifying code number by using the 
Weighted Local Lemma, stated in Lemma [5] 

In the following, given a graph G on n vertices, we will denote by /(G) the proportion of non-forced 
vertices of G, i.e. the ratio ^, where x is the number of non- forced vertices of G. 

Theorem 7. Let G be a twin-free graph on n vertices having maximum degree d > 3. Then "f'°(G) < 



Proof Let F be the set of forced vertices of G, and V = V{G) \ F. Note that \V'\ = nf{G). By the 
definition of a forced vertex, any identifying code must contain all vertices of F. 

In this proof, we first build a set S* in a random manner by choosing vertices from V' . Then we exhibit 
some "bad" configurations — if none of those occurs, the set C = FU {V' \ S) is an identifying code of G. 
Using the Weighted Local Lemma, we compute a lower bound on the (non-zero) probability that none of 
these bad events occurs. Finally, we use the Chernoff bound to show that with non-zero probability, the 
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size of S is also large enough for our purposes. This shows that such a "good" large set S exists, and it 
can be used to build an identifying code that has a sufficiently small size. 

Let p = p{d) be a probability which will be determined later. We build the set S CV' such that each 
vertex of V independently belongs to S with probability p. Therefore the random variable \S\ follows a 
binomial distribution Bm{nf{G),p) and has expected value Eds'!) = pnf{G). 

Let us now define the set £ of "bad" events. These are of four types. An illustration of these events 
is given in Figure [T] 

• Type A-' (2 < j < d + 1): for each vertex u S V , let be the event that \N[u] = j\ and 
N[u] C S. 

• Type B-' (2 < j < 2d — 2): for each pair {u,v} of adjacent vertices, let ^ be the event that 
\{N[u]AN[v])\ = j and {N[u]AN[v]) C S. 

• Type (3 < j < 2d): for each pair {u, v} of vertices in V at distance two from each other, let 
Ci^ be the event that \{N[u]AN[v])\ =j and {N[u]AN[v]) C S. 

• Type D: for each pair {u, v} of false twins in V' , let Du^v be the event that (A^[u] AA^[w]) = {u, v} C 
S. 

For the sake of simplicity, we refer to the events of type , and as events of type A, B and C 
respectively whenever the size of the symmetric difference is not relevant. 

Events of type „ are not defined since then |7V[M]AA^['i;]| = 1 and F belongs to the code, so they 
never happen. Observe that the events Cl „ and Du^v are just defined over the pairs of vertices in V 
because if either u or w belongs to F, the event does not happen. 

If no event of type A occurs, V{G) \ 5 is a dominating set of G. If no event of type B occurs, all pairs 
of adjacent vertices are separated by V{G) \ S. If no event of type C or D occurs, all pairs of vertices 
at distance 2 from each other are separated. Thus by Observation 21 if no event of type A, B, C or D 
occurs, then V{G) \ 5 is also a separating set of G, and therefore it is an identifying code of G. 




(c) Event Ci,„ (d) Event Du, 



Figure 1: The "bad" events. The vertices in dashed circles belong to set S. 

Let V{Ei) denote the set of vertices that must belong to set S so that Ei holds (see Figure [H where 
the sets V{Ei) are the ones inside the dashed circles). We will say that a vertex v S V{G) participates 
to Ei, if u e V{Ei). We define the weight ti of each event Ei e £ as |T^(£'i)|. For j > 2 and for 
T e {A^ ,B^ ,D}, let tr be the weight of an event of type T (for an event £'j e £ of type T, U = tr). 
We have the following: 
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Some vertex x can participate to at most c? + 1 events of type A since if it participates to some 
event A^^, then u G -/V[x]. Vertex x can participate to at most d{d — 1) events of type B: supposing 
X € V{Bl^^) and u is adjacent to x, there are at most d ways to choose u, and at most d — 1 ways 



to choose V among N{u) \ {x}. Observe that ii x = u or x — v, then x ^ V{Bu.v) (see Figure 1(b)). 
Similarly x can participate to at most d^(d— 1) events of type C: for some event there are at most 
d{d — 1) possibihties li x = u ot x = v and at most did — 1)^ if u or w is a neighbour of x. Finahy, x can 
participate to at most d — 1 events D^^v since x can have at most d — 1 false twins. For each type T of 
events (T S {A-', i?^ , C-' , D}) and any vertex v € V^CG), let us define g(t;, T) to be the number of events 
Ei of type T such that v € F(£'i). Hence: 

d+l 2d-2 

^g(«,A^)<d+l 5^ g(i;,i?^) <d(d-l) 

(^) 

^g(«,C^)<d^(d-l) g(«,i^)<d-l 
i=3 

Let us call Eiq the event that no event of £ occurs. Using the Weighted Local Lemma, we want to 
show that Pr(£'ic) > 0. Given two events Ei and Ej of E, we note i ^ j li V{Ei) n V{Ej) = 0. Observe 
that for any event Ei and any set T C { j : j'}, we have Pr(£'i | rij^rEj) — Pr{Ei), since the vertices 
are included in S with independent probabilities. This means that Ei is mutually independent from the 
set of all events Ej for which V{E,) nV{Ej) = ?l. 

In order to apply the Weighted Local Lemma (Lemma [2]), the following conditions must hold for each 
event Ei & £: 

The latter conditions are implied by the following ones (for each event Ei ^ £): 

d+l 2d-2 

^ J2 g{v,A^){2pY-^ + J2 E g(«,S^)(2p)*-^ + 

j=2veV{Ei) j=2 veViE,) 

2d 

E E s{v,Cn{2pYo^+ E siv,D){2pr" <j 

j=3v<£V{Ei) veV{Ei) 

Which are implied by: 



d+1 1 I 2d--2 



U- max g{v,Ai){2pY^^ } +t,- max { giv, B^){2pY^' } + 

U ■ max J V g(«, C^)i2pYc- \+U- max {g(,;, i?)(2p)*°} < | 
i)ey(Bi) I I vev{Ei) 2 

Using the bounds of Inequalities ([2]) and noting that for p < 1/4 and any j, {2pY^^ < {2pY, {2pY^^ < 
{2pY and (2p)*cj < (2p)'^, for any event i?.; this equation is implied by: 

{d + l){2pY + d{d - l){2pY +d^{d- l){2pf + {d- l){2pY = Ad^p^ + 8dV + 4^ - 8dV < ^ (3) 

Hence, we fix p = where fc is a constant to be determined later. Equation ([3|) holds for k > 3.68 
for all d> 3. In fact, in the following steps of the proof, we will assume that k > 30, and so Equation ([3]) 
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will be satisfied for any d > 3. Since p < j and Pr(iJi) < p*' by the definition of ti and the choice of S, 
the Weighted Local Lemma can be applied. 

Let Mt be the number of events of type T, where T e {A^ , , . By Lemma[5]we have: 

d+lM^j 2d-2Mgj 2d M^j Mo 

pr(£;ic) > n n (1 - (2p)*-) n n (1 - (2^)*-) n n - (2p)*-o n(i - (2^^)*") 

j=2 i=l j=2 1=1 j=3 4=1 i=l 

Note that X]j=2 ^'-^Ai — ^f{G) since by definition there exists exactly one event for each vertex of 
u € V' . Moreover, Y^^=2^ Mgj < ^ since there is exactly one event type ^ for each edge uv e E{G) 

and at most ^ edges in G. We also have that X]j=3 -^cj is at most the number of pairs of vertices in V' 
at distance 2 from each other. This is also at most the number of paths of length 2 with both endpoints 
in V', which is upper-bounded by , Finally, Mo is the number of pairs of false twins in V , 

which is at most nf(G)^^ by Proposition [51 Hence, we have: 

nd nf(G)d(d-l) n/(G)(d-l) 

Pr(i?ic) > (1 - (2p)2)"/('^)(l - {2pf)-{l - {2pf) -2 (1 - {2pf) -2 

Using Lemma [2] (more precisely, we use Equation ([T])) and the fact that p — wc obtain: 

Since /(G) < 1 and it is assumed that k > 30, one can check that for any d > 30 

r 1641og2 1 

The Weighted Local Lemma shows that S has the desired properties with probability Pr(_Eic) > 0, 
implying that such a set exists. Note that we have no guarantee on the size of S. In fact, if = then 
V{G) \ S = V{G) is always an identifying code. Therefore we need to estimate the probability that \S\ 
is far below its expected size. In order to do this, we use the Chernoff bound of Theorem |3] by putting 
a = "-^Jj^^ where c is a constant to be determined. Let -Ebig be the event that \S\ — np > — ""^^^"^ ■ We 
obtain: 



Pr(£;BiG) < cxp ■ 



cd ) 

2pnf{G) 



/ fc/(G) 



Now we have: 



Pr(SiC and ^big) = 1 - Pr(£;ic or ^big) 



> 1 - Pr(£;ic) - Pr(SBiG) 



= l-(l-Pr(Sic))-Pr(^BiG) 

= Pr(i;ic) - Pr(:Ei^) 

r 164 log 2 \ f kf{G) 

> exp < n > — exp ■ 



I5k^d ^ 2c^d 



^Note that this bound could be strengthened by assuming d to be large enough. Indeed, here the term ' can be as 
high as ^ when d = 3 and /(G) = 1, but can be chosen to be as low as desired by assuming d to be larger. However we 
aim at giving a bound for any d> 3, hence we use the weaker bound presented here. 



7 



Thus, PrfSic and £;big) > if c < We (arbitrarily) set c = ''''Zif^T in order to fulfill 

' ^ ^ / 328 log 2 ^ ^ \/221og2 

V 15 

this condition. 

Now we have to check that i?BiG implies that S is still large enough. 



1^1 > E{\S\)-^ 

nf(G) nf{G) 
kd cd 



1 y22bP ^ 7i/(G) 



k fc3/2/(G)l/^ 

Since jS"! must be positive, from Equation (jl]) we need k-^/"^ f{Gy^^ > -y22Tog2/c, which leads to 
k = for flo > 22 log 2. Using all our previous assumptions, by derivating the expression of \S\, one 

can check that 1 5*1 is maximized when qq = Si^el^ Hence we set k — ^2f(G) • 

Remark that under this condition and since /(G) < 1, we have fc > 34 and our assumption following 
Equation ^ that k > 30, is fulfilled. 

Now, with flo = ^i^^, we can see that: 

,^,^^1 l\nf{G) _ al^'-Vm^f{G)' 2 f{Gf ^ f{Gf 

' '"U cj d " ^3/2 "" 2971og2 d 103d 

Hence finally the identifying code C = V \ S has size 

|C|<^ 



n/(G)2 



103d 

□ 

Note that for regular graphs, /(G) = 1 because a forced vertex implies the existence of two vertices 
with distinct degrees. We obtain the following result: 

Corollary 8 (Graphs with constant proportion of non- forced vertices). Let G be a twin-free graph on n 
vertices having maximum degree d > 3 and f{G) — ^ for some constant a > 1. Then ^'°{G) < n— iq^^'^^ - 
In particular if G is d-regular, j'"{G) < n~ 

The next proposition will be proved in the next subsection. 

Proposition 9. Let G be a graph on n vertices and of maximum degree d. Then f{G) > -j^- 

We obtain the following general result: 

Corollary 10 (General case). Let G be a twin-free graph on n vertices having maximum degree d > 3. 
Then 7-(G) <n~ j,^,^^ = n - 

The next proposition will be proved in the next subsection as well. 

Proposition 11. Let G be a graph having no k-cligue. Then there exists a constant "f{k) depending only 
on k, such that f{G) > 

This leads to the following extension of Corollary |S1 where c{k) < 103j{k)'^: 

Corollary 12 (Graphs with bounded clique number). There exists an integer dp such that for each 
twin-free graph G on n vertices having maximum degree d > do and clique number smaller than k, 
7™(G) < n — ^^^^ for some constant c{k) depending only on k. In particular this applies to triangle-free 
graphs, planar graphs, or more generally, graphs of bounded genus. 

We remark here that the previous corollaries support Conjecture [1] They also lead us to think that 
the difficulty of the problem lies in forced vertices. 
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3.2 Bounding the number of non-forced vertices: proofs 

In this section, we prove the lower bounds for function /(G) of the statement of Theorem [71 

The foUowing lemma was first proved in [4 , and a proof can be found in [11] (as ^ is not accessible). 

Lemma 13 ( 4 ). If G is a finite twin- free graph without isolated vertices, then for every vertex u of G, 
there is a vertex v € N[u\ such that V{G) \ {v} is an identifying code of G. 

We recall the statement of Proposition [9] 

Proposition. Let G be a graph on n vertices and of maximum degree d. Then f{G) > -jzfi- 

Proof. Observe that a vertex w of G is not forced only if V{G) \ {v} is an identifying code of G. Hence, 
by Lemma [T3l the set S of non-forced vertices is a dominating set of G, and thus l^l > ^j^. □ 

Note that Proposition [9] is tight. Indeed, consider the graph Ak on 2k vertices defined in ll[ as 
follows: V{Ak) = {xi, . . . , X2k} and E{Ak) = {xiXj, \i — j\ < k — 1}. Ak can be seen as the {k — l)-th 
power of the path P2k- In the graph A^ with an additional universal vertex x (i.e. x is adjacent to all 
vertices of Ak), one can check that all vertices but x are forced. This graph has n = 2k + 1 vertices, 
maximum degree 2k and exactly 1 = non-forced vertex. Taking all forced vertices gives a minimum 
identifying code of this graph. 

However, note that since for a fixed even value of d, we know only one such graph, it is not enough 
to give a counterexample to Conjecture [TJ Indeed in this case the size of the code is n — 1 = n — = 
~ ^~ nkj ~ ^ ^ 1 ^ ^- So we ask the following question: 

Question 14. Does there exist a value of d such that for an infinite number of values of n there exists 
a graph on n vertices of maximum degree d having exactly -jp^ non-forced vertices ? 

Answering this question in positive would provide counterexamples to Conjecture [1] Note that for 
the similar question where we replace d -I- 1 by d, the answer is positive by Construction [23] of Section [6] 
For any d, this construction provides arbitrarily large graphs having exactly non-forced vertices. 

Observe that graph Ak contains two cliques of k vertices. In fact, we can improve the bound of 
Proposition [H] for graphs having no large cliques. Let us first introduce an auxiliary structure that will 
be needed in order to prove this result. 

Let G be a twin-free graph. We define a partial order ^ over the set of vertices of G such that u < v 
if N[u] C N[v]. We construct an oriented graph 'H(G) on V{G) as a subgraph of the Hasse diagram of 
poset {V{G), :<). The arc set of H{G) is the set of all arcs utt where there exists some vertex x such that 
N[v] — N[u] U {x}. Then x is uw-forced, and we note x = f(ui). For a vertex v of V{G), we define the 
set F{v) as the union of v itself and the set of all predecessors and successors of v in 'H(G). Observe that 
'H{G) has no directed cycle since it represents a partial order, and thus predecessors and successors are 
well-defined. 

Lemma 15. Let G be a graph having no k-clique. Then for each vertex u, \F[u)\ < l3(k), where P{k) is 
a function depending only on k. 

Proof. First of all, we prove that the maximum in-degree of "HiG) is at most 2k — 3, and its out-degree 
is at most k — 2. 

Let u be a vertex of G. Suppose u has 2k — 2 in- neighbours in 'H(G). Since for each in- neighbour v of 
w, |A^[u]AAf[i>]| = 1 in G, each of them is non-adjacent in G to at most one of the other in-neighbours (in 
the worst case the in-neighbours of u induce in G a clique of 2fc — 2 vertices minus the edges of a perfect 
matching). Hence they induce a clique of size at least fc — 1 in G. Together with vertex u, they form a 
fc-clique in G, a contradiction. 

Now suppose u has fc — 1 out-neighbours in 'H{G). Since for each out- neighbour u of u in 'H{G), 
N[u\ C N[v\ in G, u and its out-neighbours form a fc-clique in G, a contradiction. 

Now, consider the subgraph of H{G) induced by F{u). We claim that the longest directed chain in 
this subgraph has at most fc — 1 vertices. Indeed, all the vertices of such a chain are pairwise adjacent in 
G. Since G is assumed not to have any fc-cliques, there are at most fc — 1 vertices in a directed chain. 

Finally, we obtain that F{u) has size at most /3(fc) — ^i^Qi2k — 3)' and the claim of the lemma 
follows. □ 
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We now need to prove a few additional claims regarding the structure of T-L{G). In the following 
claims, we suppose that G is a twin-free graph. 

Claim A. Let s be a forced vertex in G with s = f{ut)) for some vertices u and v. If t is an in-neighbour 
of s in T-LiG), then v = f{ts). Moreover if v is forced with v = f{x^), then necessarily y — s. 

Proof. For the first implication, suppose s has an in-neighbour t in 'H{G). An illustration is provided in 
Figure [21 Since u '/j s, then u '/^ t. Moreover v ^ t since s = f{ui). Since s ^ v the claim follows. For 
the other implication, suppose there exist two vertices x,y such that v ~ f{xt)). Hence y ^ v but a; / w. 
Therefore u 'jj x (otherwise v would be adjacent to x too) and hence u ^ y. Now the only vertex adjacent 
to V but not to M is s, so y = s. □ 



V = fits) ^s- = fiui,) 



u t 



Figure 2: The situation of Claim Arcs belong to T-L{G). Full thin edges belong to G only, dashed edges 
are non-edges in G. 



Claim B. Let s be a forced vertex in G with s — f{ui) for some vertices u and v. Then s has at most 
one in-neighbour in 'H{G). 

Proof. Suppose s has two distinct in-neighbours t and t' in T-L{G) (see Figure [3] for an illustration). By 
Claim [XI V is both fs-forced and t's-forced. But then N[t] — N[s] \ {v} = N[t']. Then t and t' are twins, 
a contradiction since G is twin-free. □ 




Figure 3: The situation of Claim [B] Arcs belong to ^{G). Full thin edges belong to G only, dashed edges 
are non-edges in G. 



Claim C. Let s be a forced vertex in G with s = f{ui), and let t be a forced in-neighbour of s in H^G) 
with t = f{x^) for some vertices u, v, x, y. Then x = v. 

Proof. Since t ^ y, then s ^ y too. But since t — f{xtj), x ^ s and x ^ t. Now by Claim [A} v — f(ts), 
that is, v is the unique vertex such that v is adjacent to s, but not to t. Therefore x = v. □ 

We now obtain the following lemma using the previous claims. 

Lemma 16. Let s be a non-isolated sink in 'H(G) which is forced in G with s = f{ui) for some vertices 
u and v. Then either s has a non-forced predecessor t in 'H{G) such that F{s) C F{t), or there exists a 
non-forced vertex w{s) such that F{s) C Ng[w{s)]. Moreover, if there are £ additional sinks {si, . . . , si} 
which are all non-isolated in 'H(G) and such that w{s) — w{si) = . . . — w{si), then there exists a set of 
£ -\- 1 distinct vertices inducing a clique together with w{s). 
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Proof. First of all, recall that T-L{G) has no directed circuits. Suppose s has a non- forced predecessor 
in H{G) and let t be one such predecessor having the shortest distance to s in H{G). By Claim iBl 
predecessors of s are either successors or predecessors of t, and there is a directed path from t to s in 
H{G). Hence F{s) C F{t), which proves the first part of the statement. 

Now suppose all predecessors of s = f{ut)) are forced. By Claim [Bl s and its predecessors form a 



directed path {tp, . . . ,tm,s} in 'H(G) (for an illustration, see Figure 4(a)). Note that by Claim [A} we 
have V — f{t,ns). By our assumption we know that tm is forced, say tm = fixvm) for some vertices x 
and Vm- But now by Claim [Cl x — v and ~ f{vu^). Now, repeating these arguments for each other 
predecessor of s shows that there is a directed path {u, d, Dm, . . . , uq} with t„i = f{vu^) and for all i, 
Q<i<m — l,ti = f{vi+iv\). In particular, <o = /(f iWo)- Observe also that for all i > 1, vi = /(U-iti). 
By applying Claim [Cl on vertices vi^vq and to, if vq is forced then to has an in-neighbour in 'H{G), a 
contradiction — hence is non-forced. Moreover note that since vq ~ to, then wq is adjacent to all 
successors of to in ^(G), that is, to aU elements of F{s). Therefore, putting w{s) — wq, we obtain the 
second part of the statement. 

For the last part, suppose there exists a set of £ additional forced sinks {si, . . . , s^} which are non- 
isolated in ?^(G) and such that all their predecessors in 'H(G) are forced with w{si) — vq ior 1 < i < I 



(for an illustration, see Figure 4(b) I. For each such sink Si, by the previous paragraph, the vertices of 
F{si) induce a directed path {tg, . . . , t\^.,Si} in ■H(G). Moreover we know that there is a vertex Xi such 
that tp is XiVo-iorced. We claim that the set of vertices X = {xi, . . . ,xi} together with and wi, form 
a clique in G of ^ -I- 2 vertices. 

We first claim that for all «,j in Xi ^ tg. If i = j, this is clear by our assumptions. 

Otherwise, suppose by contradiction, that Xi = ig for some i ^ j in {1,...,^}. Then we claim that 
Xj = Iq. Indeed, by the previous part of the proof, we know that /{xJvq) = tg ~ Xi — hence Xj ^ Xi. 
But since XiVp is an arc in 'H{G), we must have f{xiVo) = Xj. Again, we know that f{xiVo) = tg, hence 
Xj = Iq. Let t\ denote the successor of in the directed path from to Si in H{G). We know from the 

previous part of the proof that /(tgi^) = a^i = ^o- However since ig = Xj we also know that /(tgWo) — x^. 
This implies that Nq[vo] = A^g[^i], ^ contradiction since these two vertices are distinct and G is twin-free. 

Now, observe that the vertices of X must all be pairwise adjacent. All vertices of X are adjacent to 
vo, and for each Xi, N[vo] — N[xi] U {<g}, hence Xi is adjacent to all neighbours of vq except tg. But by 
the previous paragraph, we know that tg ^ xj for all j S {1, . . . ,^}, hence Xi is adjacent to all Xj ^ Xi, 
j G {1, ...,£}. For the same reason, each Xi is adjacent to vi. Hence, the vertices of X form a clique 
together with vq and vi. 

Finally, let us show that all the vertices of X are distinct: by contradiction, suppose that Xi — Xj for 
some i^j,l<iTj< £■ Since tg is XitiQ-forced and tg is ij-tiQ-forced, we have tg = tg. Since Si and Sj are 
distinct, this means that Sj and Sj have one predecessor in common. Hence their common predecessor 
which is nearest to Si and Sj, say t, has two out-neighbours. Let t^ (respectively tj) be the out-neighbour 



of t which is a predecessor of Si (respectively sj) — see Figure 4(c) for an illustration. We know that 



there are two vertices yi,yj such that yi — f{tti) and yj — f{ttj). First note that yi and yj are distinct: 
otherwise, we would have N[ti] = N[t] U {yi} ~ N[t] U {yj} ~ N[tj] and then ti,tj would be twins in G. 
Observe that since t ^ yi and yi ^ f{ttj), we have tj / yi. We know that t is forced, in fact by the first 
part of this proof, we also know that t = f(jjiz\) for some vertex Zi. Hence Zi ~ t, and since N[t\ C N[tj], 
Zi ~ tj. But since tj ^ f{yiz\), tj ~ y^, a contradiction. Hence Xi and Xj are distinct, which completes 
the proof. □ 

Finally, let us recall and prove Proposition [TT] 

Proposition. Let G be a graph having no k-clique. Then there exists a constant 7(fc) depending only on 
k, such that f{G) > i:^- 

Proof. To prove the result, we use 'H(G) to construct a set X = {xi, . . . , Xi} of non-forced vertices such 
that IJi=i ^i^i) = ^(C), where A{xi) is a set of at most 7(fc) vertices. Then we have £ > vertices 
in X and the claim of the proposition follows. 

We now describe a procedure to build set X while considering each non-isolated sink of H{G). We 
denote by s the currently considered sink. 
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(c) If Xi = Xj, the dotted edge yitj is both 
an edge and a non-edge of G 

Figure 4: Three situations in the proof of Lemma [T51 Arcs belong to H{G). Full thin edges belong to G 
only, dashed edges are non-edges in G. 
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Case 1: Sink s is non-forced. Then we set A{s) to be F{s) together with ah the vertices which are 
forced by a pair u,v of vertices of F{s). Note that by Lemma [T51 1-^(5)1 < f3{k), where /3(fc) only depends 
on k. Hence, \A{s)\ < P{k) + C^^^'^^). 

Case 2: Sink s is forced. By LemmafTCl either s has a non-forced predecessor t such that F{s) C F{t), 
or there exists a non-forced vertex w{s) such that F{s) C NqIw]. 

In the first case, we choose t as our non-forced vertex, and we set A{t) to be F{t) together with all 
the vertices which are forced by a pair u, v of vertices of F{t). Again we have \ A{t)\ < l3{k) + {^^2'') ■ 

In the second case, we choose w = w{s) as our non-forced vertex. Now, let S — {s,si, . . . ,se} be 
the set of forced sinks having no non-forced predecessor and such that w{s) = ly(si) = . . .w{st.). By 
Lemma [T51 we know that there are distinct vertices inducing a clique together with w, hence i+2 < k. 
We set A{w) to be F{w) U F{s) U F{si) U . . . U F{se) together with all the vertices which are forced by a 
pair u,v of vertices of this set. We have \ A{w)\ < kl3{k) + C''^^''^)- 

We have now covered all the vertices which are not isolated in T-L{G), since for each non-isolated sink 
s of T-l{G), F{s) is a subset of A{x) for some x G X. Moreover all isolated vertices of H{G) which are 
forced, have also been put into some set A{x). Hence only non- forced isolated vertices of 7^(G) need to 
be covered. For each such vertex v, we add v to X and set A{v) = {v}. 

Finally, all vertices belong to some set A{x), x G X, and the size of each set A{x) is at most 
7(fc) = k(3{k) + i'"^^''''), which completes the proof. □ 



4 Upper bounds for graphs with girth at least 5 

This section is devoted to the study of graphs that have girth at least 5. We will use these results in 
Section [21 which deals with random regular graphs. 

Despite being different than our previous proofs, the ones of this section have also a probabilistic 
flavour. One can check that for graphs of girth 5, applying the Local Lemma does not lead to a satisfying 
result. However, by using the Alteration method, a better bound can be given. 

We start by defining an auxiliary notion that will be used in this section. A subset D C V{G) is 
called a 2-dommating set if for each vertex v of V{G) \ D, \N{v) n D| > 2 [9]. The next lemma shows 
that we can use a 2-dominating set to construct an identifying code. 

Lemma 17. Let G be a twin-free graph on n vertices having girth at least 5. Let D he a 2-dominating 
set ofG. If the subgraph induced by D, G[D], has no isolated edge, D is an identifying code of G. 

Proof. First observe that D is dominating since it is 2-dominating. Let us check that D is also separating. 

Note that all the vertices that do not belong to D are separated because they are dominated at least 
twice each and g{G) > 4. 

Similarly, a vertex x € D and a vertex y € V{G) \ D are separated since y has two vertices which 
dominate it, but they cannot both dominate x (otherwise there would be a triangle or a 4-cycle in G). 

Finally, consider two vertices of D. If they are not adjacent they are separated by themselves. 
Otherwise, by the assumption that G[D] has no isolated edge and that G has no triangles, we know 
that at least one of them has a neighbour in D, which separates them since it is not a neighbour of the 
other. □ 

The following theorem makes use of Lemma [T7l The idea of the proof is inspired by a classic proof of 
a result on dominating sets which can be found in the first chapter of [1]. 

Theorem 18. Let G be a graph on n vertices with minimum degree 6 and girth at least 5. Then "f'°{G) < 
(l + 05(l))^^n. Moreover if G has average degree d = Os{6{\ogS)'^) then^^iG) < ^°sS+iogiogS+Os{i) 

Proof. Let S C V{G) be a random subset of vertices, where each vertex v G V{G) is added to S uniformly 
at random with probability p (where p will be determined later). For every vertex v G V{G), we define 
the random variable X„ as follows: 



X„ 



if \N[v] n S"! > 2 

1 otherwise 
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Let T = {v \ Xy = 1}. This set contains, in particular, the subset of vertices which are not 2- 
dominated by 5*. Note that |T| = J^^v- Let us estimate the size of T. Observing that \N[v] n S'| '-^ 
Bin(deg(w) + and deg(w) > 6, we obtain: 

E(|r|) = ^(^") 

v<£V{G) 

< n {{1 ~ p)^+^ + {S + l)p{l - pY) 
= n{l^p)'iil-p) + {d + l)p) 

< n{l + Sp)e-^P. 

where we have used the fact that 1 — x < e^^. Now, note that the set _D = S* U T is a 2-dominating set 
of G. We have \D\ < \S\ + \T\. Hence 

E{\D\) < E{\S\)+E{\T\) 

< np + n{l + 5p)e~^P (5) 

Let us set p = '°g '^+'°g '°g ^ . Plugging this into Equation (O, we obtain: 

Ti^nnh ^ + , 1 + log^ + loglogJ Iog(5 + Ioglog5 + O5(l) 
E{\D\) < n + n = n 

This shows that there exists at least one 2-dominating set D having this size. 

Case 1: (general case) Note that we can use Lemma [T71 by considering all pairs u, v of vertices of D 
forming an isolated edge in G[D], and add an arbitrary neighbour of either one of them to D. Observe 
that such a vertex exists, otherwise u and v would be twins in G. Since there are at most such pairs, 

we obtain a 2-dominating set of size at most \D\ + = (1 + os{V))^^^^n having the desired property. 
Now applying Lemma [TTl completes Case 1. 

Case 2: (sparse case) Whenever d = 05((5(log (5)^), we can get a better bound by estimating the 
number of isolated edges of G[Z3]. For convenience, we define the random variables Y^v for each edge uv 
of G, as follows: 

1 \i N[u]/:\N[v](ZV{G)\S 
otherwise 



Y 

J II.'. 



An isolated edge in G[D] might have been created in several ways. First, at the initial construction 
step of S: if both u, v belong to 5, but none of their other neighbours do which happens with probability 
at most p^{l — p)"^^^^- A second possibility is in the step where we add the vertices of T to our solution. 
This could happen if both u, v were not dominated at all by S, which occurs with probability at most 
(1 — p)^^ , or if exactly one of u, v was part of S and none of their neighbours were, which has probability 
at most 2p{l — p)^^~^. Thus, the total probability of having an isolated edge in G[D] is bounded from 
above as follows. 

Pr(K. = !)</(!- pf'-' + (1 - pf + Ml - pf'-' = (1 - pf'-' 
Using the previous observation together with the facts that p — '"s ■^+iog log ^ g^-^^j 1 — x < e^^, let us 
calculate the expected value of y = ^uveE{G) ^^-v- 

rr., . V- , . nd, ^9A 9 f9A 9Vn ^de" ^ ('"g *+i°g '"g ud 
E(r)= E nYu.)<^{l-pr-'<^e-^''-'^^^ -^^1(1^ 

We construct U by picking an arbitrary neighbour of either u or v for each edge uv such that Yuv = 1- 
We have \U\ < Y. The final set C = S" U T U C/ is an identifying code. Now we have: 

E(|C|) < E(l^l) + E(|r|) + Eim < ^"''^ + ^'^'^ ^ ' ^ 



2(52 (log ,5)2 
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Using that d = OsiS{\ogS)^), 



E(|C|) < 



log (5 + log \og6 + Os{l) 
S 



■n 



(6) 



Then there exists some choice of S such that \C\ has the desired size, and completes the proof. □ 

In fact, it is shown in the next section (Corollary [22]) that Theorem [TSl is asymptotically tight. 

Moreover, note that Theorem [18] cannot be extended much in the sense that if we drop the condition 
on girth 5, we know arbitrarily large d-regular triangle-free graphs having large minimum identifying 
codes. For instance. Construction [25] of Section [6] provides a graph G which satisfies 7™(G) — n — 2- 
Similarly, we cannot drop the minimum degree condition. Indeed it is known that any (d— l)-ary complete 
tree Td^h of height h, which is of maximum degree d, minimum degree 1 and has infinite girth, also has 
a large identifying code number (i.e. Y°{Td.h) = n — d-i+od(i) El^' 

5 Identifying codes of random regular graphs 

From the study of regular graphs arises the question of the value of the identifying code number for most 
regular graphs. We know some lower and upper bounds for this parameter, but is it concentrated around 
some value? A good way to study this question is to look at random regular graphs. 

Consider the Configuration Model, where a d-regular multigraph on n vertices is obtained by selecting 
some perfect matching of Knd at random (see [S] for further reference). We will only consider cases where 
nd is even, as otherwise there does not exist any d-regular graph on n vertices. In the Configuration 
Model, the set of vertices in Knd is partitioned into n cells of size d and each cell Wy is associated to a 
vertex v of the random regular graph. An edge e of a perfect matching of Knd induces either a loop in 
V (if it connects two elements of W^) or an edge between v and u (if it connects a vertex from to a 
vertex in Wu)- 

In general, this model may produce graphs with loops and multiple edges. We will denote by G*{n, d) 
the former probability space and by Q{n, d) the same probability space conditioned on the event that G 
is simple. It is shown in [2j that the following holds: 



Thus, for constant d any property that holds with probability tending to 1 for G*{n,d) as n — ^ oo, will 
also hold with probability tending to 1 for Q{n,d). In this case we will say that the property holds 
with high probability (w.h.p.). In fact our bounds include asymptotic terms in d, which means they are 
meaningful for sufficiently large d. 

Theorem 19. Let G G g{n,d) then for any d>3, Y"{G) < ^°sd+iogiogd+OAi) ^ w.h.p.. 

Proof. First of all we have to show that almost all random regular graphs are twin-free. 

Observe that the number of perfect matchings of is (2r7i — 1)!! — (2m — l)(2m — 3)(2m — 5) ... 1. 
Fix a vertex m of G and let N{u) = {wi, . . . We bound from above the probability that u and Vi 

are twins, i.e. N[u] — N[vi\. The number of perfect matchings of Knd such that in the resulting graph 
G of Q{n, d), vi and V2 are adjacent, is at most {d — l){d — l){nd — 2d — 3)!!. Indeed, there must be an 
edge between vi and W2, which gives {d— l){d— 1) possibilities. Since u has d neighbours, the number of 
possibilities for the remaining graph is the number of perfect matchings of Knd-2d-2- 

Analogously the number of perfect matchings with W2,W3 € N{vi) is at most [d — l)((i— 1)(<^— 2)(c? — 
l){nd -2d- 5)!!. Thus we have: 



Pr(Geg(n,d) \ Geg*{n, 



d)) =(l + o(l))e— 



if d = o{^/n). 
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Pr{N[u] = N[vi]) < Pr(7V[u] C N[vi]) 

{d- l){d l){d - 2){d - 1) . . . 2(d - l)l{d - l){nd - 4d + 1)!! 
(nd - 2d - 1)!! 

d'^-^{d-l)\ 



< 



(nd - 2d - 1) . . . (nd - 4d + 3) 

(dV-^ 

< — I for n large enough. 



As we have at most ^ possible pairs of twins (one for each edge), by the union bound and since 
d > 3, for sufficiently large n we obtain: 



nd f d^ ^ 
Pr(G has twins) < — I — 

which tends to as n tends to infinity. 

Therefore, random regular graphs are twin-free w.h.p. 
By for any G E Q{n,d), we have a set C with 

< logd + loglogd + Orf(l) ^^ 
~ d 

that separates any pair of vertices except from the ones where both vertices belong to a triangle or a 
4-cycle. We have to add some vertices to C in order to separate the vertices of these small cycles. 

Classical results on random regular graphs (independently, [6j Corollary 2.19] and [28]) state that the 
random variables that count the number of cycles of length k, X^, tend in distribution to independent 
Poisson variables with parameter A^, — l)*"- 

Observe that: 

IE(^3) ^ g ^ EiXi)^^-^ 

i.e. a constant number of triangles and 4-cycles are expected. 

Using Markov's inequality we can bound the probability of having too many small cycles: 

Pr(X3 > t) < Pr(X, > t) < 

Setting t — i?(n), where i9(n) — ;> cxd, the previous probabilities are o(l). Then w.h.p., we have at most 
t?(n) cycles of length 3 and i?(n) cycles of length 4. 

Let T = {ui,U2,U3} be a triangle in G. As d > 3 there exists at least one vertex Vi outside the 
triangle (moreover, we showed that the graph has no twins w.h.p.). Since our graph is twin-free, for each 
ordered pair (ui,Uj) there exists some vertex Wy , such that Vij € N(ui)\N{uj). Observe that we can add 
Vi2, V23 and V31 to C and then any pair of vertices from T will be separated. 

If r = 1*2, U3, U4} induces a K4, each pair of vertices of T is contained in some triangle and 
is separated by the last step. If T induces a 4-cycle, adding T to C separates all the elements in T. 
Otherwise, T induces two triangles and adding T to C separates the two vertices which have not been 
separated in the last step. 

After these two steps, we have added at most 7d{n) vertices to C. Hence, for any G E G{n,d) w.h.p. 
we obtain: 

lutr-^ ^ '^osd + log\ogd + Od{l) x logd + loglogd + 0<j(l) 

7 (Gj < ; n + 7v[n) = ; n 

d d 

Observe that the ^^^jp-n term contains the 7'd{n) term. □ 
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Theorem shows that despite the fact that for any d, we know infinitely many d-regular graphs 
having a very large identifying code number (e.g. n — ^ for the graphs of Construction [Ml of Section [5]), 
almost all d-regular graphs have a very small identifying code. 

Moreover, 7™(G) is concentrated, as the following theorem and its corollary show. In fact the following 
result might be already known, since a similar result is stated for independent dominating sets in [17) . 
However we could not find it in the literature and decided to give a proof for the sake of completeness. 

Theorem 20. Let G G Q{n, d), then w.h.p. all the dominating sets ofG have size at least i°g ^°^'^ n. 

Proof. We will proceed by contradiction. Given a set of vertices D of size to, we will compute the 
probability that D dominates Y = V{G) \ D. Recall that G has been obtained from the configuration 
model by selecting a random perfect matching of Knd- Let y & Y fixed, then let Ay = {N{D) n {y} ^ 0} 
be the event that y is dominated by D. Its complementary event corresponds to the situation where 
none of the edges of the perfect matching of Knd connects the points corresponding to y to the ones 
corresponding to any vertex of D. Define Wd = ^veoWy as the set of cells corresponding to D in Knd- 
Then for any v £ Wd, the event By corresponds to the fact that v is not connected to any point in Wy. 

If Wd = {Wl, . . . ,Vrad}-, 



Pr(Ay) = Pr(n„ew'oS„) 

= Pr(i3,., ) Pr(i?,, I J . . . Pr(i?,„, | f^TJl'B., ) 

1-^-^1 (1-^^|...(1- 



nd ~ 1 J \ nd — 3 J \ nd — (2md — 1) 



md 



> 



nd- {2i-l) 

n i-r^ 



n — 2to 



Since \ -x = e-^+(i°g(i-^)+^) (here we take x = ■^^^) and log(l - x) ^ x = 0(x^) (by the Taylor 
expansion of the logarithm in a; = 0), we obtain: 



- -{-''-<"'^} 

The probability that D is dominating all vertices oiY — {j/i, . . . , yn-ra\ is: 

Pr {^y^yAy) = Pr (A, J Pr {Ay, J . . . Pr (Ay„_ | M,^) 

We claim that Pr {^Ay. \ n*C^^Aj^^. ) < Pr (Aj,. ). Suppose that yi, . . . , yi-\ are dominated. This means 
that the corresponding perfect matching of Knd has an edge between one of the points corresponding to 
j/j (1 < j < « — 1) and one of the points corresponding to the vertices of D. The probability that yi is not 
dominated by D is now the probability that none of the remaining edges of the perfect matching connect 
any vertex of D with yi. Hence: 



Pr(A,, |n}-lA,^)= 1 



> 1 



nd-2i + l 
d 



1 - 



nd — 2i — 1 



d 



nd — 2md + 1 



nd — 1 

=pr(A;:) 



1 



nd — 3 



nd — 2md + 1 
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By considering the complementary events, Pr (Ay. | n^^gAy^) < Pr (Ay.). Hence these events are 
negatively correlated, and: 

"tT / mrf \"-" f nid ) 

Pr (HyeyAy) < II Pr(AyJ < 1 - e "-2m J < exp <^ -(n - m)e »-2m I 
For the sake of contradiction, let m < ^°sd-c\ogiogd ^ ^^^^ ^ ^ 2. Then: 



logd- cloglogd\ j logd-cloglogd 

Pr (riygyAy) < exp < - 1 nexp 



= exp |- (1 + Od(l)) nexp 

(log d)<= 



\ d 

log d — c log log d ~ 



l + Od(l) 



Note that if no set of size m dominates Y , neither will do a smaller one. So we have to look just at 
the sets of size m. The number of these sets can be bounded by 



= (l + o<i(l)) 



log d—c log log d 

de 



log d — c log log d 

log d—c log log d 

de 



logd 

where we have used ml > (^)'"- 

Let Eds be the event that G has a dominating set of size m. Applying the union bound, we obtain: 

log d—c log log d 

= (1 + 0.(1)) exp {i^^i^^^i^dogd + 1 - loglogd)n - 

^ (1 +0.(1)) exp /r(i^^-(i^^+o.ri^^ 



since c > 2. This shows that w.h.p. no set of size less than '"s ^ 2 jog log d ^ dominate the whole graph 
and completes the proof. □ 

Since any identifying code is also a dominating set, we obtain the following immediate corollary. 
Corollary 21. Let G G Gin^d), then w.h.p. Y"{G) > ]2s±zl^ml£sdn. 

Plugging together Theorems fT9l and [20l we obtain the following result. 

Corollary 22. Let G E Q{n,d), then w.h.p. 

log d - 2 log log d ,^ \ogd + log log d + Od{l) 
^ n < 7 (G) < n 

6 Extremal constructions 



This section gathers some constructions which show the tightness of some of our upper bounds. Some 
of these constructions can be found in [10] . 
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Construction 23. Given any dn -regular multigraph H (without loops) on tih vertices, let Ci{H) be the 
graph on n = nnidn + 1) and maximum degree d ~ dn + 1 constructed as follows: 

1. Replace each vertex v of H by a clique K{v) of dn + 1 vertices 

2. For each vertex v of H , let N(v) — {vi, . . . , v^i^} and K{v) = {fco(w), . . . , k^i^ {v)}. For each ki(v) 
but one (1 < i < dn), connect it with an edge in Ci{H), to a unique vertex of K{vi), denoted 

fihiv)). 

One can see that the graphs Ci{H) given by Construction[23]are twin-free. Moreover, for each vertex v 
of H and for each 1 < i < d^j^ note that / {ki{v)) is ko{v)ki{v)-ioT:ceA. Therefore Ci{H) has djfUff = n— 2. 
forced vertices. In fact these forced vertices form an identifying code, therefore 7™(Ci(i/)) — n — 2- An 
example of this construction is given in Figure [SI where H is the hypercube of dimension 3, H^, and the 
black vertices are those which belong to a minimum identifying code of Ci{H^). 




Figure 5: The graphs and Ci(i?3) 



The following construction is very similar, but yields regular graphs. 

Construction 24. fjQ) / Given any dn-regular multigraph H (without loops) on nn vertices, let C2{H) 
be the d-regular graph on n = nndH vertices (where d = dn) constructed as follows: 

1. Replace each vertex v of H by a clique K{v) of dn vertices. 

2. For each vertex v of H , let N{v) = \v\, . . . , Wd^f } and K{v) — {ki(v), . . . , kd^ («)}. For each ki{v) 
(1 < i < du), connect it with an edge in C2{H), to a unique vertex of K{vi), denoted f {ki{v)). 

Note that for some vertex v of H, in order to separate each pair of vertices ki{v), kj{v) of K{v) in 
C2{H), either f {ki{v)) or f {kj{v)) must belong to any identifying code. Repeating this argument for 
each pair shows that at least d — 1 such vertices are needed in the code. Since for any two cliques K{u) 
and K{v), the set of these neighbours are disjoint, this shows that at least nnid— 1) vertices are needed 
in an identifying code of C2{H). In fact it is easy to construct an identifying code of this size. This 
shows that despite the fact that C2{H) has no forced vertices, 7™(C2(i?)) = n — ^. An example of 
this construction is given in Figure [HI where H is the complete graph i^s, and the black vertices form a 
minimum identifying code of C2{K^). 

Construction [23l and [24l are close to Sierpihski graphs, which were defined in [20]. Recently in |16j . 
it has been shown that Sierpihski graphs are also extremal with respect to Conjecture [TJ i.e. for any 
Sierpihski graph G on n vertices with maximum degree d, 7^°(G) = n — ^. 
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Figure 6: The graphs K<^ and C2{K^) 



Construction 25. fiO) / Given an even number 2k and an integer d > 3, we construct a twin-free d-regular 
triangle-free graph C3{2k, d) on n = 2kd vertices as follows. 

1. Let {cq, . . . , C2k-i} be a set of 2k vertices and add the edges of the perfect matching {ciCi+i mod 2fc | 
i is odd}. 

2. For each even i (0 < i < 2k — 2), build a copy K{i) of the complete bipartite graph Kd~i,d-i- 
Join vertex Ci to all vertices of one part of the bipartition of K{i), and join vertex Ci+i to all other 
vertices of K{i). 

Consider an identifying code of C3{2k,d). Note that in each copy K{i) of Kd-i.d-i, at least 2d — 4 
vertices belong to the code in order to separate the vertices being in the same part of the bipartition of 
K{i). Now if exactly 2d — 4 vertices of K{i) belong to the code, in order to separate the two remaining 
vertices, either Ci or c^+i belongs to the code. Hence for each odd i, at most three vertices from {q, Ci+i}U 
V{K{i)) do not belong to a code of C3(2fc, d). On the other hand, taking all vertices Ci such that i is even 
together with d — 2 vertices of each part of the bipartition of each copy of Kd^i.d-i yields an identifying 
code of this size. Hence 7™(C3(2fc, d)) = k + 2k{d — 2) — n — 2j/^- An example of this construction 
is given in Figure [71 where 2fc = 8, c? = 3, and the black vertices form a minimum identifying code of 
C3(8,3). 




Figure 7: The graph €3(8, 3) 
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